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Abstract. It is an open problem to characterize the class of languages recognized by quantum finite 
automata (QFA). We examine some necessary and some sufficient conditions for a (regular) language 
to be recognizable by a QFA. For a subclass of regular languages we get a condition which is necessary 
r~Hi and sufficient. 

Also, we prove that the class of languages recognizable by a QFA is not closed under union or any other 
binary Boolean operation where both arguments are significant. 



1 Introduction 
> 

A quantum finite automaton (QFA) is a theoretical model for a quantum computer with a finite memory. 
If we compare them with their classical (non-quantum) counterparts, QFAs have both strengths and 
0^ , weaknesses. The strength of QFAs is shown by the fact that quantum automata can be exponentially more 

space efficient than deterministic or probabilistic automata | AF 9<j| . The weakness of QFAs is caused by 
the fact that any quantum process has to be reversible (unitary). This makes quantum automata unable to 
recognize some regular languages. 



o 
o 

i-^ \ The first result of this type was obtained by Kondacs and Watrous | KW 97| who showed that there is 

Q-i, 



a language that can be recognized by a deterministic finite automaton (DFA) but cannot be recognized by 
QFA. Later, Brodsky and Pippenger [ BP 99 ] generalized the construction of |KW 97 and showed that any 
regular language that does not satisfy the partial order condition cannot be recognized by a QFA. They also 
conjectured that all regular languages satisfying the partial order condition can be recognized by a QFA. 

In this paper, we disprove their conjecture. We show that, for a language to be recognizable by a 1-way 
QFA, its minimal deterministic automaton must not contain several "forbidden fragments" . One of these 
fragments is equivalent to the automaton not satisfying the partial order condition. The other fragments are 
r> ' new. 

A somewhat surprising feature of our "forbidden fragments" is that they consist of several parts (corre- 
sponding to different beginnings of the word) and the language corresponding to every one of them can be 
recognized but one cannot simultaneously recognize the whole language without violating unitarity. 

Our result implies that the set of languages recognizable by QFAs is not closed under union. In particular, 
if we consider the language consisting of all words in the alphabet {a, b} that have an even number of a's after 
the first b, this language is not recognizable by a QFA, although it is a union of two recognizable languages. 
(The first language consists of all words with an even number of a's before the first b and an even number 
of a's after the first b, the second language consists of all words with an odd number of a's before the first b 



and an even number of a's after it.) This answers a question of Brodsky and Pippenger | BP 9£ |. 

For a subclass of regular languages (languages that do not contain "two cycles in a row" construction 
shown in Fig. g), we show that our conditions are necessary and sufficient for a language to be recognizable 
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by a QFA. For arbitrary regular languages, we only know that these conditions are necessary but we do not 
know if all languages satisfying them can be recognized by a QFA. 

1.1 Definitions 



Quantum finite automata (QFA) were introduced independently by Moore and Crutchficld [MC 97 1 and 



Kondacs and Watrous [KW 97]. In this paper, we consider the more general definition of QFAs [KW 97] 



(which includes the definition of [MC 97 1 as a special case) 



Definition 1.1. A QFA is a tuple M = {Q; S]V;qo]Q a cc',Qrej) where Q is a finite set of states, U is an 
input alphabet, V is a transition function, qo^Q is a starting state, and Q acc C Q and Q re j C Q are sets 
of accepting and rejecting states (Q aC c H Qrej = %)■ The states in Q acc and Qrej, are called halting states 
and the states in Q non = Q — (Qacc U Qrej) are called non halting states, k and $ are symbols that do not 
belong to S. We use k and $ as the left and the right endmarker, respectively. The working alphabet of M 
is r= £U {«;$}. 

The state of M can be any superposition of states in Q (i. e., any linear combination of them with complex 
coefficients). We use \q) to denote the superposition consisting of state q only. ^(Q) denotes the linear space 
consisting of all superpositions, with ^-distance on this linear space. 

The transition function V is a mapping from T x ^(Q) t° h{Q) such that, for every a£T, the function 
V a : h(Q) ~> h(Q) defined by V a (x) = V(a, x) is a unitary transformation (a linear transformation on hiQ) 
that preserves I2 norm). 

The computation of a QFA starts in the superposition \q$). Then transformations corresponding to the 
left endmarker k, the letters of the input word x and the right endmarker $ are applied. The transformation 
corresponding to a€T consists of two steps. 

1. First, V a is applied. The new superposition ip' is V a (tp) where ip is the superposition before this step. 

2. Then, ip' is observed with respect to E a cc,E r ej,E non where E acc — span{\q) : q<EQ a cc}, E re j = 
span{\q) : q^Qrej}, E non — span{\q) : q<EQ n on}. It means that if the system's state before the measurement 
was 

qt&Qacc qj^Qrej qk&Qnon 

then the measurement accepts ip' with probability Sal, rejects with probability £[3? and continues the 
computation (applies transformations corresponding to next letters) with probability ^7^ with the system 
having state ip = S^k \qk}- 

We regard these two transformations as reading a letter a. We use V' a to denote the transformation 
consisting of V a followed by projection to E non . This is the transformation mapping ip to the non- halting 
part of V a (ip). We use V^ to denote the product of transformations V^ = V'a^V'a n _ 1 ■ ■ ■ V^V^, where a, is 
the ?-th letter of the word w. We also use tp y to denote the non-halting part of QFA's state after reading the 
left endmarker n and the word y£S* . From the notation it follows that ip w = V{. w {\qo))- 

We will say that an automaton recognizes a language L with probability p (p > h) if it accepts any word 
xGL with probability > p and rejects any word x^L with probability > p. 

1.2 Previous work 

The previous work on 1-way quantum finite automata (QFAs) has mainly considered 3 questions: 

1. What is the class of languages recognized by QFAs? 

2. What accepting probabilities can be achieved? 

3. How does the size of QFAs (the number of states) compare to the size of deterministic (probabilistic) 
automata? 



In this paper, we consider the first question. The first results in this direction were obtained by Kondacs 
and Watrous 0KW 97|] . 



Theorem 1.1. \KW 9% 



1. All languages recognized by 1-way QFAs are regular. 

2. There is a regular language that cannot be recognized by a 1-way QFA with probability ^ + e f or an V £ > 0. 



Brodsky and Pippenger [BP 99 1 generalized the second part of Theorem 1 .1 by showing that any language 
satisfying a certain property is not recognizable by a QFA. 



Theorem 1.2. fiP 9£j Let L be a language and M be its minimal automaton (the smallest DFA recognizing 
L). Assume that there is a word x such that M contains states q±, qi satisfying: 

1- <7i 7^ 12, 

2. If M starts in the state q\ and reads x, it passes to q2, 

3. If M starts in the state qi and reads x, it passes to q-z, and 

4-. There is a word y such that if M starts in qi and reads y, it passes to q\, 

then L cannot be recognized by any 1-way quantum finite automaton (Figxnj. 




Fig. 1. Conditions of theorem 1.2 



A language L with the minimal automaton not containing a fragment of Theorem 1.2 is called satisfy- 
ing the partial order condition [MT 6£]. [ BP 9S( | conjectured that any language satisfying the partial order 
condition is recognizable by a 1-way QFA. In this paper, we disprove this conjecture. 

Another direction of research is studying the accepting probabilities of QFAs. This direction started with 
Ambainis and Freivalds | AF 9Sf| showing that the language a* b* is recognizable by a QFA with probability 
0.68... but not with probability 7/9 + e for any e > 0. This showed that the classes of langua ges recogn izable 
with different probabilities are different. Next results in this direction were obtained by [ABFK99] who 
studied the probabilities with which the languages a\ . . .a* can be recognized. 

There is also a lot of results about the number of states needed for QFA to recognize different languages. In 
some cases, it can be exponentially less than for deterministic or even for probabi listic automata |AF 98 K ! 



In other cases, it can be exponentially bigger than for deterministic automata |ANTV 9S,N 
A good survey about quantum automata is Gruska [G 00] 



2 Main results 

2.1 Necessary condition 

First, we give the new condition which implies that the language is not recognizable by a QFA. Similarly to 
the previous condition (Theorems 1.2), it can be formulated as a condition about the minimal deterministic 
automaton of a language. 



Theorem 2.1. Let L be a language. Assume that there are words x, y, z\, zi such that its minimal automaton 
M contains states q\, qi, 93 satisfying: 

1- 92 ¥= 93, 

2. if M starts in the state q\ and reads x, it passes to q<x, 

3. if M starts in the state qi and reads x, it passes to 02, 
4- if M starts in the state q\ and reads y, it passes to #3, 

5. if M starts in the state q% and reads y, it passes to q%, 

6. for all words t E (x\y)* there exists a word t\ E (x\y)* such that if M starts in the state qi and reads 
it passes to q2, 

7. for all words t E (x\y)* there exists a word t\ E (x\y)* such that if M starts in the state q% and reads 
it passes to q%, 

8. if M starts in the state qi and reads z\ , it passes to an accepting state, 

9. if M starts in the state qi and reads z<i, it passes to a rejecting state, 

10. if M starts in the state qs and reads z\, it passes to a rejecting state, 

11. if M starts in the state q% and reads Z2, it passes to an accepting state. 
Then L cannot be recognized by a 1-way QFA. 



tt 



tt 




Fig. 2. Conditions of theorem 2.1, conditions 6 and 7 are shown symbolically 



Proof. We use a lemma from [BV 97 



Lemma 2.1. If ip and (f> are two quantum states and \\ip — <fi\\ < e then the total variational distance between 
the probability distributions generated by the same measurement on ip and <fi is at mos^j 2e. 



We also use a lemma from [ AF 9£ ] . 
Lemma 2.2. Let x E S + . There are subspaces E\, E2 such that E n 



Ex © E 2 and 



(1) If ill £ Ei, then V^(ip) E E x and \VJ$)\ = ||^||, 
(ii) If if) E E 2 , then ||V' fc (^i)|| -+ when k -* 00. 



1 The lemma in [BV 97 has 4e but it can be improved to 2e. 



Lemma 2.2 can be viewed as a quantum counterpart of the classification of states for Markov chains 
[KS 76]. The classification of states divides the states of a Markov chain into ergodic sets and transient sets. 
If the Markov chain is in an ergodic set, it never leaves it. If it is in a transient set, it leaves it with probability 
1 — e for an arbitrary e > after sufficiently many steps. 

In the quantum case, E\ is the counterpart of an ergodic set: if the quantum random process defined by 
repeated reading of a; is in a state ip G E\, it stays in E\. Ei is a counterpart of a transient set: if the state is 
ip G E2, E-2 is left (for an accepting or rejecting state) with probability arbitrarily close to 1 after sufficiently 
many x's. 

The next Lemma is our generalization of Lemma [2.2| for the case of two different words x and y. 

Lemma 2.3. Let x,y E ZJ + . There are subspaces E\, E2 such that E non — E\ E2 and 

(1) Ifi\)E E u then V£(<ip) € E x and V y (ip) G E x and \\V^)\\ = \\ip\\ and \\V^)\\ = 
(ii) If ip G E2, then for any e > 0, there exists a word t G (x\y)* such that \\Vf(ip)\\ < e 



Proof. We use E\ to denote the space E\ from Lemma 2.2 for a word z. We define E\ — P| E\. E2 

ze(x\y)* 

consists of all vectors in E non orthogonal to E\. Next, we check that both (i) and (ii) are true. 

(i) It is easy to see that, for all t G {x\y)* , \\Vl{4>)\\ = \\ip\\ due to ip G E\. 

We also need to prove that V x (ip) G E\ and Vy(ip) E E\. For a contradiction, assume there are ip G I?i 
and £1 G (a;|y)* such that VJ' (V>) ^ £i- Then, by definition of E\, there also exists t 2 G (^|y)* such that 
VI (ip) does not belong to E[ 2 . Lemma |2^ implies that the norm of VJ (ip) can be decreased by repeated 
applications of V( . A contradiction with ||V r /(^)|| = ||^|| for all t. 

(ii) Clearly, if ip belongs to E 2 then for all t G (x\y)* the superposition V t '(ip) also belongs to E 2 because 
V x and V v are unitary and map E\ to itself (and, therefore, any vector orthogonal to E\ is mapped to a 
vector orthogonal to E\). 

\\Vl(ip)\\ does not increase if we extend the word t to the right and it is bounded from below by 0. Hence, 
for any fixed e we can find a t G (x\y)* such that 

\\v{m-\\vM)\\<* 

for all w G (x\y)* ■ We define a sequence of such words ti,t 2 ,t 3 , ... for e, |, |, Vl^ip), V/ 2 (ip), V t ' (ip), . . . 

is a bounded sequence in a finite dimensional space. Therefore, it has a limit point ip' . We will show that 
ip' = 0. 

First, notice that ip' G E% because it is a limit of a subsequence of V( (ip),V( (ip),V( (ip), . . . and all 
V(.(ip) belong to E^- Therefore, if ip' ^ then ip' has nonzero E\ component for some z G (x\y)* . Reading 
sufficiently many z would decrease this component, decreasing the norm of ip' . 

This contradicts the fact that, for any w, \\ip'\\ — \\V^(ip')\\ (since ll^'ll — || 1 1C(V'')II ^ ^ ess than any e > 
which is true because ip' is the limit of V{ t (ip), V/ 2 (ip), V( 3 (ip), . . .). 

Therefore, ip' = 0. This completes the proof of lemma. □ 

Let L be a language such that its minimal automaton M contains the "forbidden construction" and M q 
be a QFA. We show that M q does not recognize L. 

Let w be a word after reading which M is in the state q\. Let ip w — ipl, + ip%,, ipl, G E\, ip%, G E 2 . We 
find a word a G (x\y)* such that after reading xa M is in the state 02 and the norm of ip1, xa = V^(ip^, x ) is at 
most some fixed e > 0. (Such word exists due to Lemma |2.3| and conditions 6 and 7.) We also find a word b 
such that \\ipl yb \\ <e. 



Because of unitarity of V' x and Vy on E\ (part (i) of Lemma |2.3| ) , there exist integers i and j such that 
K ixa y - V-4II < e and ||^ (y6)J - Vill < e. ' 



Let p be the probability of M q accepting while reading kw. Let p\ be the probability of accepting while 
reading (xa)' 1 with a starting state ip w , P2 be the probability of accepting while reading (yby with a starting 
state ip w and p%, p^ be the probabilities of accepting while reading z\§ and Z2$ with a starting state ip^. 

Let us consider four words K,w(xa)' l Zi%, Kw(xa) l Z2§, Kw(yb) J Zi$, Kw(yb) J z 2 $- 

Lemma 2.4. M q accepts kw(xo)' 1 z\% with probability at least p + pi +P3 — 4e and ai mosi p + pi +P3 + 4e. 

Proof. The probability of accepting while reading kw is p. After that, M q is in the state tp w and reading 
(xa) 1 in this state causes it to accept with probability p\. 

The remaining state is ip w ( xa )i = V'ifxaV +4'w(xa) i - ^ ^ was V'ii the probability of accepting while reading 
the rest of the word (zi$) would be exactly p$. It is not quite ipw but it is close to i/jw- Namely, we have 



Uv 



{xa,y 



1>l\\< 



w{xay 



II + Ui 



w(xa) z 



Vill<e + e = 2e. 



By Lemma 2.1, this means that the probability of accepting during zi§ is between p 3 — 4e and p 3 + 4e 



D 
Pi - 4e and p + p\ + p± + 4e. 
\-P2 +P3 + 4e. On the fourth 



Similarly, on the second word M q accepts with probability between p + p\ H 
On the third word M g accepts with probability between p + p2 + p% — 4e and p 
word M q accepts with probability p + P2 + p± — 4e and p + P2 + Pi + 4e. 

This means that the sum of accepting probabilities of two words that belong to L (the first and the fourth 
words) differs from the sum of accepting probabilities of two words that do not belong to L (the second and 
the third) by at most 16e. Hence, the probability of correct answer of M q on one of these words is at most 
\ + 4e. Since such 4 words can be constructed for arbitrarily small e, this means that M q does not recognize 



L. 



D 



2.2 Necessary and sufficient condition 

For languages whose minimal automaton does not contain the construction of Figure 0, this condition 



(together with Theorem 1.2) is necessary and sufficient 



Theorem 2.2. Let U be the class of languages whose minimal automaton does not contain "two cycles in 
a row" (Fig. |jj. A language that belongs to U can be recognized by a 1-way QFA if and only if its minimal 
deterministic automaton doe s no t contain the "forbidden construction" from Theorem Li and the "forbidden 
construction" from Theorem 2.1. 




X 




qi) x Jv 92 ^y ,93 




Fig. 3. Conditions of theorem 2.1 



Proof. Let M be the minimal deterministic automaton of a language L. If it contains at least one of " forbidden 
constructions" of Theorems 1.2 and 2.1, then L cannot be recognized by a 1-way QFA. We now show that, 
if M does not contain any of the two "forbidden constructions" and does not contain "two cycles in a row" 
construction then L can be recognized by a QFA. 

Let go be the starting state of M and V be the transition function of the automaton M . V(q, x) denotes 
the state to which M goes if it reads the word x in the state q. 



We will construct a QFA for L by splitting M into pieces A, B\, . . . , B n , constructing a reversible finite 
automaton for each of those pieces and then combining these reversible automata. 

Let B be the set of all states q such that after reading any word in q, there exists a word such that M 
passes back to the state q. We split B into connected components B\ , B2 , . . . , B n . Two different states qi 
and qj belong to the same Bk iff q% is reachable from qj and qj is reachable from qt . Let A be the set of all 
remaining states, i. e., the states that do not belong to B = B\ U B2 U . . . U B n . 




Fig. 4. Division of M 



Lemma 2.5. For every letter a and every state q of Bi, there is exactly one q' G Bi such that reading a in 
q' leads to q, i. e., every letter induces a permutation of states in Bi. 

Proof. Let q be a state in Bi and a G S. For a contradiction, assume that there are two states q' G Bi and 
q" G B t such that V(q', a) = V(q", a) = q. 

Then, there exist words x' and x" such that V(q, x") = q" and V(g", a;') = q' . (This is true because every 
state in Bi is reachable from every other state in Bi.) 

However, this means that Bi contains the "forbidden construction" of Theorem L2. with qi — q' , q2 = q" , 
x = ax" and y = x' . A contradiction. □ 

Such automata Bi are called permutation automata. 




X 



<1\ ! ( 92 




Fig. 5. Conditions of Lemma 2.6 



Lemma 2.6. A does not contain a fragment of the form shown in Fig. [J with qi and q^ being two different 
states of A and x 6 S + . 

Proof. For a contradiction, assume that A contains such a fragment. 

By definition of A and B, qi G A implies that there is a word z such that reading z in qi leads to a state 
q' 2 and qi is not reachable from q' 2 . 



Consider the states V(q%, z), V(q2, z 2 ), .... M has a finite number of states. Therefore, there must be j 
and k such that V(q 2 , z ] ) = V(q 2 , z ]+k ). Notice that this implies V(q 2 ,z°') = V(q 2 , z ] ' +k ) for all j' > j. 

Let i be the smallest number such that i > j and i is divisible by k. Define q^ — V(q 2l z l ). Then, 
V(q 2 , Z*) = V(q 2 , z i+k ) = V(q 2 , z l+2k ) = . . . = V(q 2 , z 2% ) (because i is divisible by k), i. e., q 3 = V(q 3l z i ). 

We have shown that qi, q 2 , q 3 form a "two cycles in a row" construction with y = z l . A contradiction. □ 



By a theorem from | AF 9q| , any language recognizable by a deterministic automaton which does not 



contain the construction of Fig. [^ is recognizable by a reversible finite automaton(RFA). (A reversible finite 
automaton is a deterministic automaton in which, for every state q and letter a, there is at most one state 
q' such that reading a in q' leads to q.) 

Any reversible automaton is a special case of a quantum automaton. (If, for every state q and every letter 
a, there is one q' such that reading a leads to q, the letter a induces a permutation on states of automaton 
and the corresponding transformation of a quantum automaton is clearly unitary.) 

Therefore, the language recognized by A is recognized by a QFA as well. Also, permutation automata 
B\, . . . , B n are special cases of reversible automata. Therefore, they can be replaced by equivalent QFAs. 
We will construct a QFA for L by combining those QFAs. 

However, before that, we must solve one problem. Even if the state of M after reading a word x is in 
Bi, the starting state of M can be in A. If we want to use the permutation automaton for Bj to recognize 
a part of L, we must define one of states in Bi as the starting state. The next two lemmas show that this is 
possible. 

Lemma 2.7. If the minimal automaton M contains the construction of Fig. 0, the states q 2 and q 3 cannot 
be in the same Bi. 

Proof. Let us suppose the opposite. 

q 2 and (73 are different states of the minimal deterministic automaton. Therefore, there exists a word a 
such that V{q 2 , a) is an accepting state (or a rejecting state) and V(q 3 , a) is a rejecting state (or an accepting 
state). 

Also, there is no word I such that V(q 2 ,l) is a rejecting state (or an accepting state) and V(q3,l) is an 
accepting state (or a rejecting state) because, otherwise, M would contain the construction of Theorem [O. 

We denote V(q 2 , a) by q aC c and V(q3, a) by q re j- There exists a word b such that V(q re j, b) — q acc (because 
all states in Bi can be reached from one another). Moreover, the states V(q 2 ,ab) and V(qs, ab) are accepting 
states (V(q3,ab) is accepting because V(q3,ab) = V(q re j,b) = q acc and V(q 2l ab) is accepting because, if 
it was rejecting, qi, q 2 and 93 would form the construction of Theorem [j.l| with a and ab as z\ and z 2 ). 
Similarly, the states V{q 2 , abb) and V{q3, abb) are accepting states, the states V(q 2 , abbb) and V(q3, abbb) are 
accepting states and so on. However, there exists k such that V(q3, ab k ) = V(q3,a) = q re j (because Bi is a 
permutation automaton and, therefore, it must return to the starting state after some number of 6's) . This 
gives us the contradiction. □ 



Lemma 2.8. For each part Bi, there is a state q such that ifV(qo,x) belongs to Bi then V(qo,x) — V(q,x). 

Proof. Let V(qo,x) G Bi. Then, there is a unique q G Bi such that V(qo,x) — V(q,x). (This is true because 
Bi is a permutation automaton and every state has a unique preceding state.) We must show that this state 
q does not depend on the word x. 

Assume this is not true. Then, there are words x and y such that V{qo,x) — V(q 2l x) and V(qo,y) — 
V(q 3 ,y) and q 2 ^ q 3 (and V{q 2 ,x), q 2 , V(q 3 ,y), q 3 are all in Bi). 

Let ?4 = V(q 2 ,x) and q$ = V(q3,y). Then, there exist j G IN and k G IN such that V(q4,x : >) = q^ and 
V{qbi1) k ) — 15- (Again, we are using the fact that Bi is a permutation automaton, and, therefore, if it reads 
the same word many times, it returns to the same state at some point.) 




Fig. 6. 



This implies V(q4,x : ' x ) = q 2 and V(qs,y k 1 ) = q% (because V(q2,x) = 04 and V(q3,y) — (75 and, in a 
permutation automaton, q such that V(q,x) = 04 must be unique). Therefore, V(qo,x 3 ) — V(q4,x : '^ 1 ) = q 2 
and V{q 2 ,x : >) = Viq^x^ 1 ) = q 2 . Similarly, V(q ,y k ) = V(q 3 ,y k ) = g 3 . 

This means that Bi contains the states q 2 and (73 fr om t he construction shown in Figure |6| (with x^ and 



y instead of x and y and go instead of gi). By Lemma |2.7| , this is impossible. A contradiction. □ 

We denote these states as qi. Let B[ be the automaton Bi with gj as the starting state. Let Li be the 
language recognized by B[. 

Lemma 2.9. For any i,j G {1, . . . , n}, either Li C Lj or ly C L^. 

Proof. Let a; and y be such that V(qo,x) = qi and V(go,y) = gj ■ (x and y exist because, otherwise, #, or qj 
would be unreachable from the starting state qo.) By Lemma 2.8 , V{qi,x) = qi and V(qj,y) = qj. 

For a contradiction, assume that neither Li C Lj nor Lj C Li is tru e. T hen, there are words z\ G Li — Lj 
and z 2 £ Lj — Li and we get the "forbidden construction" of Theorem 2.1. □ 



Let ai denote the number of j G { 1 , . . . , n} such that Lj C Li. A' denotes the corresponding reversible 
automaton for the automaton A with one modification: when the automaton M passes to a state of B\ U 
B 2 U . . . U B n A' accepts with probability ^~_° i and rejects with probability 2i rj ; - 



Next, we define a QFA recognizing the language L: it works as A' with probability p - ,, 
amplitude 



rt+l 



(with 



n+l 



j) and as Bi with probability 2 n+i ( w ith amplitude 



for each i. 



2n+l' * *- ' 2n+l v «" V2n+1' 

Case i. V(go,^) G -4- The QFA recognizes x with probability p. 

Case 2. V(qo,x) G Bi and x E L. The automaton A' accepts with probability n ~' ^ . Moreover, x is 

accepted by at least aj automata from SJ, _B 2 , • ■ ■ , -B^- This means that the total probability of accepting is 

at least 

n — a,i n+l a, + 1 n+l 

n + 1 ' 2n + l 2?i + l ~ 2n + 1 ~ P ' 

Case 3. V{qa,x) G Bi and x ^ L. Similarly to the previous case, the total probability of rejecting is at 
least p. □ 



3 Non-closure under union 
3.1 Non-closure result 



In particular, Theorem 2.1 implies that the class of languages recognized by QFAs is not closed under union. 

Let L\ be the language consisting of all words that start with any number of letters a and after first 
letter b (if there is one) there is an odd number of letters a. Its minimal automaton G\ is shown in FigM 

This language satisfies the conditions of Theorem 2.1. (q±, qi and 53 of Theorem 2.1 are just qi, 02 and 
53 of G\. x, y, Z\ and z 2 are 6, aba, ab and 6.) Hence, it cannot be recognized by a QFA. 




Fig. 7. Automaton G\ 



Consider 2 other languages Li and L 3 denned as follows. 

L2 consists of all words which start with an even number of letters a and after first letter b (if there is 
one) there is an odd number of letters a. 

L3 consists of all words which start with an odd number of letters a and after first letter b (if there is 
one) there is an odd number of letters a. 

It is easy to see that L\ — L 2 {JL 3 . 

The minimal automatons G2 and G3 are shown in Fig.|8| and Fig.|9j. They do not contain any of the 
"forbidden constructions" of Theorem 2.2. Therefore, L2 and L3 can be recognized by a QFA and we get 



Theorem 3.1. There are two languages L2 and L3 which are recognizable by a QFA but the union of them 
L\ = L2U L3 is not recognizable by a QFA. 



Corollary 3.1. The class of languages recognizable by a QFA is not closed under union. 



This answers a question of Brodsky and Pippenger [BP 99]. 

As L 2 Pi £3 = then also L\ = L2AL3. So the class of languages recognizable by QFA is not closed 
under symmetric difference. From this and from the fact that this class is closed under complement, it easily 
follows: 



Corollary 3.2. The class of languages recognizable by a QFA is not closed under any binary boolean oper- 
ation where both arguments are significant. 



3.2 Another construction of QFAs 



Instead of using the general construction of Theorem 2.2, we can also use a construction specific to languages 



L2 and L3. This gives simpler QFAs and achieves a better probability of correct answer. (Theorem 2.2 
gives QFAs for L2 and L3 with the probability of correct answer 3/5. Our construction below achieves the 
probability of correct answer 2/3.) 

We construct two quantum automata K 2 and K 3 which recognize languages L 2 and L 3 . Like G 2 and G3 
they differ only in a starting state. 





Fig. 8. Automaton G 2 



Fig. 9. Automaton G3 



The automaton K 2 consists of 8 states: 91, q 2 , 93, 94, 95, 96, 97, 98, where <5„ 
{95,9s}, Qrej = {96, 97}- The unitary transform matrices V K , 14, VJ, and V$ are: 



{91,92,93,94}, Q 
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The starting state is q\ for K 2 and q^ for K3 
The state 91 in G 2 corresponds to tpi - 

The state 92 in G 2 corresponds to ip 2 - 

The state 93 in G2 corresponds to ip 3 ■■ 

The state 94 in G 2 corresponds to ^4 = 




00 
01/ 

Next 



,V $ = 



/00010000\ 
00100000 
01000000 
10000000 
00001000 
00000100 
00000010 

yoooooooi/ 



/00001000\ 
00000100 
00000001 
00000010 
10000000 
01000000 
00010000 

\00100000y 



we show that K 2 works similarly to G2. 
I9i) + y / jl92) mK 2 . 
\q 2 ) in K 2 . 
\q 3 ) in K 2 . 

I? 4 ) + V I I* 3 ) in K 2- 

1. After reading the left endmarker k K 2 is in the state VJ^qi)) — i'l- G 2 is in its starting state 91. 

2. If by reading the letter a the automaton G 2 passes from qi to 94 or back then the state of K 2 changes 
from ipi to ip4 or back. 



3. If K 2 receives the letter b in the state -04 then it rejects the input with probability |. This is correct 
because G 2 passes from 94 to the "all rejecting" state q$. 

4. If G 2 receives the letter b in the state q\ it passes to q%. If K 2 receives the letter b in the state tpx, it passes 
to the state -4= \q2) + —7= | (75) + -4= \qo) and after the measurement accepts the input with probability |, 

rejects the input with the same probability 4, or continues in the state ip 2 . 

5. If by reading the letter a the automaton G 2 passes from q 2 to q-$ or back then the state of K 2 changes 
from i\)i to tp3 or back. By reading the letter b G 2 passes from q 2 to q 2 and from (73 to q%. K 2 passes from 
ip 2 to ip2 and from 1JJ3 to 1^3. 

6. If K 2 receives the right endmarker in state ipi then the input is accepted with probability |. 

7. If K 2 receives the right endmarker in state tp2 then the input is rejected with probability i and as it was 
rejected with the same probability so far, the total probability to reject the input is |. 

8. If K 2 receives the right endmarker in state ips then the input is accepted with probability -| and as it 
was accepted with the same probability so far the total probability to accept the input is 3. 

9. If K 2 receives the right endmarker in state ip4 then the input is rejected with probability |. 

This shows that, whenever G 2 is in a state qi (i 6 {1, 2, 3, 4}), K 2 is in the corresponding state ipi. Also, 
this shows that K 2 accepts input with probability g iff it receives right endmarker $ in one of states ipi or 
?/>3 corresponding to q\ and (73, the only accepting states in G\. So, we can conclude that K 2 accepts the 
language L 2 with probability |. Similarly, we can show that K 3 accepts L 3 with probability |. 

Thus, we have shown 

Theorem 3.2. There are two languages L 2 and L3 which are recognizable by a QFA with probability | but 
the union of them, L\ = L 2 1J £3 is not recognizable with a QFA (with any probability 1/2 + e, e > OJ. 

3.3 On accepting probabilities 



The probabilities for L 2 and L3 achieved in Theorem 3.2 are the best possible, as shown by the following 
theorem. 

Theorem 3.3. If 2 languages L\ and L 2 are recognizable by a QFA with probabilities p\ andp 2 and — + — < 
3 then L = L\ 1J L 2 is also recognizable by QFA with probability — , Pl1 \ 2 — . 

Proof. We have a QFA K\ which accepts L\ with probability p\ and a QFA K 2 which accepts L 2 with 
probability p 2 . We will make a QFA K which will work like this: 

P2 



1. Runs K\ with probability 

2. Runs K 2 with probability 

3. Accepts input with probability 



P1+P2+P1P2 ' 
Pi 



P1+P2+P1P2 ' 

1. wELi and w<EL 2 — > input is accepted with probability 



V2 , Pi VW2 1 3pip 2 

• Pi H ; ; • P2 H ■ ■ 1 = 



P1+P2+ P1P2 P1+P2+ P1P2 P1+P2+ P1P2 P1+P2+ P1P2 

2. w£Li and w^L 2 — > input is accepted with probability at least 

P2 P1P2 1 2pip 2 

• Pi H ; ; 1 = 



P1+P2+ P1P2 P1+P2+ P1P2 P1+P2+ P1P2 

3. w^Jj\ and w£L 2 — > input is accepted with probability at least 

Pi P1P2 1 %PlP2 

■ P2 H ; ; 1 = 



P1+P2+P1P2 P1+P2+P1P2 P1+P2+P1P2 



4. wtfiLi and w^L 2 — ► input is rejected with probability at least 

P2 Pi 2piP2 

■ Pi H ; ; • P2 



Pi +P2+ P1P2 P1+P2+ P1P2 ' Pi +P2+ P1P2 

So automaton K recognizes L with the probability at least 

2 Pl p 2 __ 1 | 3-(^- + £) ^ 1 



P1+P2+P1P2 2 4(1 + ^ + ^) 2 

D 

All this has also a nice geometric interpretation. We are going to build a linear function / from probabili- 
ties xi, x 2 to probability x such that f(pi,Pi) > 5+e, /(pi)O) > ^+e, f(0,p 2 ) > |+£, /(l— pi,l— P2) < 5— £■ 
Geometrically we consider a plane x, y where each word w is located in a point (x, y), where x is probability 
that i^i accepts w and y is the probability, that K 2 accepts w. 

S\ is the place where lies all words that do not belong to L. S 2 is the place where lies all words that 
belong to L. 

If we can (Fig.|12[) separate these two parts with a line ax + by = c then we can construct automaton 
"K = aK\ + bK 2 " with c as isolated cut point. If we can not (Fig.[13|) then this method doesn't help. And 
as it was shown higher sometimes none of other methods can help, too. 

Case when pi = p% = | (FigJIJ) is the limit case. If any of probabilities were a little bit greater then this 
method would help. 

Sometimes it may be that there are no words w such that K\ or K 2 would reject with probability 1 — t 
or greater. Then (Fig.^5J) you can see that now it is easier to make such a line so the condition — + — < 3 
can be weakened (the probabilities in Fig.|l5| are the same as in Fig.oh. In the limit case when rejecting 
probabilities are only pi and p 2 , Si is the point (1 — pi,l — p 2 ) (Fig.|l6|). So with any pi and p 2 you can 
separate S\ from S2 with a line, from what it follows you can always construct K — K\ (J K 2 . 

Now it is clear that the languages L 2 and L3 defined in section 3 cannot be recognized with probability 
greater than | so the construction presented there is best possible. 

Corollary 3.3. If 2 languages L\ and L 2 are recognizable by a QFA with probabilities pi andp 2 andpi > 2/3 
and p 2 > 2/3, then L = L\ (J L 2 is recognizable by QFA with probability p% > 1/2. 



Proof. Follows immediately from Theorem p.3| . D 

4 More "forbidden" constructions 



If we allow the "two cycles in a row" construction, Theorem 2.2 is not longer true. More and more complicated 



"forbidden fragments" that imply non-recognizability by a QFA are possible. 

Theorem 4.1. Let L be a language and M be its minimal automaton. If M contains a fragment of the form 
shown in Figure |7^ where a,b,c,d,e, f,g,h,i £ S* are words and q Q , q a , q b , q c , q ad , q ae , q bd , qbf, q ce , q c f 
are states of M and 

1. If M reads x € {a,b,c\ in the state qo, its state changes to q x . 

2. If M reads x G {a,b,c} in the state q x , its state again becomes q x . 

3. If M reads any string consisting of a, b and c in a state q x (x G {a, b, c}), it moves to a state from which 
it can return to the same state q x by reading some (possibly, different) string consisting of a, b and c. 

4- If M reads y G {d, e, /} in the state q x (x G {a, b, c}), it moves to the state q X yQ 



2 Note: we do not have this constraint (and the next two constraints) for pairs x — a,y — f, x — b, y = e and x = c, 
y — d for which the state q xy is not defined. 



5. If M reads y G {a,b,c} in a state q xy , its state again becomes q xy . 

6. If M reads any string consisting of d, e and f in the state q xy it moves to a state from which it can 
return to the same state q xy by reading some (possibly, different) string consisting of d, e and f . 

7. Reading g in the state q a d, h in the state qbf and i in the state q ce leads to accepting states. Reading h 
in the state q ae , i in the state qtd, g in the state q c f leads to rejecting states. 

then L is not recognizable by a QFA. 




Fig. 10. Conditions of theorem 4.1 



Proof. For a contradiction, assume that Q is a QFA that recognizes L with a probability 1/2+e. We construct 
6 words such that Q gives a wrong answer on at least one of them. 

Let be a superposition of QFA corresponding to the state qo (the superposition after reading some 



word w that leads to go)- Similarly to the proof of Theorem 2.1, we consider decompositions E non = Ef(BE% 
for all x £ {a, b, c}* and take E\ = C\ x Ef , E 2 — E non — E\. 



Let = ipi + 02, 01 G Ei, -02 6 Ei. Similarly to the proof of Theorem 2J, there is a word a! <E {a, b, c} 
with the first letter a such that HV^/^i) — V'lll — ^ an d IIKMV^)!! < 8 where 5 = e/20. Also, there are words 
b' and d with the first letters b and c and the same property. 

Next, we consider the decompositions E non — Ef © J5f for all x £ {d,e,f}* and take E 3 — H X E^, 
E^ = E non — E A . Let 0i = -03 + ^4, 03 e E 3 , 04 6 JS4. Let rf', e', /' G {d, e, /}* be words with the first letters 
d, e and / such that ||V^',(03) — 03 1| < 6 and ||V^,(04)|| < 8 (and similar inequalities hold for e' and /'). 

Let po be the probability of accepting while reading the left endmarker n and the word w that leads 
to the superposition 0o- Let p ai pi,, p c be the probabilities of accepting while reading a', b' and d if the 
starting superposition is 0. Let pd, p e and pf be the probabilities of accepting while reading a', b' and c' if 
the starting superposition is 0i. Let p g , pu and pi be the probabilities of accepting while reading g$, h$ and 
i$ if the starting superposition is 03 . 

Lemma 4.1. Let p a id> g be the probability of accepting the word wa'd'g. Then, 

\Pa'd'g ~ (PO + Pa + Pd + Pg)\ < 12(5. 



Proof. Pa'd'g is the sum of probabilities of accepting while reading kw, accepting while reading a' ', accepting 
while reading d! and accepting while reading g$. The first two probabilities are exactly po and p a . 

The probability of accepting while reading d! may differ from pd because the state of Q after reading 
Kwa' is V^/(-0) and the state used to define pa is ip\. However, these two probabilities differ by at most 4<5 
because 

II W) - Vill < IIK'(^i) "V'lll + UK'(V*)II < 26 

and the probability distributions resulting from observing V' a , (ip) and tpi can differ by at most twice the 
distance between superpositions (Lemma 2.1). 

Similarly, the distance between the state of Q after reading na! d! and ip3 is at most AS and this implies that 
the probability of accepting while reading g$ portion of nwa'd 1 ' g% differs from p g by at most 8(5. Therefore, 
the difference between p a >d' g and (p + p a + Pd + Pg) is at most 4(5 + 8<5 = 125. 

Similar bounds are true for probabilities of accepting wo' f'h, wc'e'i, wa'e'h, wb'd'i, wc' f'g. (We denote 
these probabilities pb'fh, Pc'e'i, Pa'e'h, Pb'd'i, Pc'fg-) By putting the bounds for p a >d>g, Pb'fh, Pc'e'i together, 
we get 

\{Pa'd'g + Pb'fh + Pc'e'i) - (3po + Pa + Pb + Pc + Pd + Pe + Pf + Pg + Ph + Pi)\ < 
\Pa'd'g - (PO + Pa + Pd + Pg)\ + \Pb'f'h - (PO + Pb +Pf + Ph)\ + \Pc'e'i - (PQ + Pc + Pe + Pi)\ < 36(5. 

Putting the bounds for p a ' e 'h, Pb'd'i, Pc'fg together gives 

\{Pa'e'h + Pb'd'i + Pc'fg) - {3p + Pa + Pb + Pc + Pd + Pe + Pf + Pg + Ph + Pi)\ < 36(5, 
\(Pa'd'g + Pb'fh + Pc'e'i) - (Pa'e'h + Pb'd'i + Pc> f g)\ < 725. 

However, each of p a 'd'gi Pb' f'hi Pc'e'i is the probability of accepting a word in L and must be at least 1/2 + e 
and each of p a 'e'hi Pb'd'i, Pc'fg is the probability of accepting a word not in L and must be at most 1/2 — e. 
Therefore, 



(Pa'd'g+ Pb'fh + Pc'e'i) ~ {Pa'e'h + Pb'd'i + Pc'fg) > 6e = 120(5. 



A contradiction. 



□ 



The existence of the "forbidden construction" of Theorem 4.1 does not imply the existence of any of 
previously shown "forbidden constructions" . 

This can be shown as follows. Consider the alphabet S = {a, 6, c, d, e, /, g, h, i} and languages of the form 
L x ,y,z = x(a\b\c)*y(d\e\f)*z where x 6 {a, b, c}, y 6 {d, e, /}, z S {g, h, i}. Our language L will be the union 
of languages L XtViZ corresponding to black squares in Figure nil 
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Fig. 11. The language L 



Theorem 4.2. The minimal automaton of L does not contain the "forbidden construction" of Theorem 2.1 



Proof. The minimal automaton of L has a structure similar to Figure [10|, with some more states. Similarly 
to Figure [UJ the states of the minimal automaton of L can be partitioned into 4 levels: 

1. The starting state (nothing read so far). 

2. The states after reading a, b or c. 

3. The states after reading a, b or c and d, e or /. 

4. The states after reading a, b or c, d, e or / and </, h or i. 



If we look for the "forbidden construction" of Theorem 2.1, then 52 and q$ should be in the 2 nd or 3 rd 
level. (q2 or q$ cannot be in the 1 st level because, after reading any letter in the starting state, M leaves it 
and never returns. Also, qi or q^ cannot be in the 4 th level because every state in it is "all-accepting" or 
"all-rejecting".) This leaves us with 3 possible cases. 

1. q2 and q$ are both states on the 2 nd level (after the automaton has read a, & or c). 

The sets of words that are accepted from q 2 and q$ correspond to black pieces in 3 x 3 squares in Figure 
O . For any two of these three squares, the black pieces in one of them are subset of the black pieces in 
the other. (And this means there is no words Zi, Z2 such that z\ gets accepted from qi but not from q% 
and zi gets accepted from q% and not from 02) 

2. g2 and q% are two of the 9 states on the 3 rd level (after reading one of a, b or c and one of d, e or /). 
The sets of words that lead to acceptance correspond to rows in Figure |ll]. One can easily see that any 
two of them are subsets of one another. 

3. One of 92 and 93 is on the 2 nd level and the other is on the 3 rd level. 

W. 1. o. g., assume that (72 is on the 2 nd level and q% is on the 3 rd level. Then, the word y that leads the 
automaton M from q\ to q% must contain one of letters d, e and /. However, reading d, e or / in the 
state <?2 would lead M to a state in the 3 rd level from which it cannot return to qi (and, therefore, the 



condition 6 of Theorem 2.1 is violated) 



In all 3 cases, we see that one of conditions of Theorem |2.1| is violated. Therefore, the minimal automaton 

□ 



M does not contain the "forbidden construction" of Theorem 2.1 



However, one can easily see that the minimal automaton of L contains the "forbidden construction" of 



Theorem [4. l|. (Just take go to be the starting state and make a, b, . . . , i of Theorem |4.l| equal to corresponding 



letters in the alphabet £.) This means that the existence of "forbid den construction" of Theorem 11 does 



not imply the existence of the "forbidden construction" of Theorem 2.1 



Theorem 4.1 can be generalized to any number of levels (cycles following one another) and any number 
of branchings at one level as long as every arc from one vertex to other is traversed the same number of 
times in paths leading to accepting states and in paths leading to rejecting states. 

A general "forbidden construction" is as follows. 

Level 1 of such a construction consists of a state q\ and some words an, ai2, 

Level 2 consists of the states 021 , 022 , • ■ ■ where the automaton goes if it reads one of words of Level 1 in 
a state in Level 1. We require that, if the automaton starts in one of states of Level 2 and reads any string 
consisting of words of Level 1 it can return to the same state reading some string consisting of these words. 
Level 2 also has some words 021, 022, 

Level 3 consists of the states q%\ , 932 , . . . where the automaton goes if it reads one of words of Level 2 in 
a state in Level 2. We require that, if the automaton starts in one of states of Level 3 and reads any string 
consisting of words of Level 2 it can return to the same state reading some string consisting of these words. 
Again, Level 3 also has some words 031, 032, 



Level n consists of the states q n i , 
n — 1 in a state in Level n — 1. 



Qn2, 



where the automaton goes if it reads one of words of Level 



Let us denote all different words in this construction as 01, 02, 03, ■ • ■ , a m . 

For a word a, and a level j we construct sets of states Bij and Dy . A state q in level j + 1 belongs to Bij 
if the word cti belongs to level j and M moves to q after reading <Zj in some state in level j. A state belongs 
to Dij if this state belongs to the Level n and it is reachable from B^ . 

Theorem 4.3. Assume that the minimal automaton M of a language L contains the "forbidden construc- 
tion" of the general form described above and, in this construction, for each D^ the number of accepting 
states is equal to the number of rejecting states. Then, L cannot be recognized by a 1-way QFA. 



Theorems 2.1 and 4.1 are special cases of this theorem (with 3 and 4 levels, respectively). 
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